JC2Q Rec^d PCI/FIQ ^ 6 AUS 2QQi 



FORM pro- 1 390 (Modified) 
(REV 11-2000) 



U.S. DEPARTMENT OF COMMERCE PATENT AND TRADEMARK OFFICE 

TitANSMlTTAL LETTER TO THE UNITED STATES 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 U.S.C. 371 



ATTORNEY'S DOCKET NUMBER 
A32312-PCT USA 



U.S. APPLICATION NO. (IF KNOWTSl, ! 

09/913741 



EE 37 CFR 



INTERNATIONAL APPLICATION NO. 
PCT/USOO/04505 



INTERNATIONAL FILING DATE 
22 February 2000 



PRIORITY DATE CLAIMED 

19 February 1999 



TITLE OF INVENTION 

CUT AND PASTE DOCUMENT SUMMARIZATION SYSTEM AND METHOD 



APPLICANT(S) FOR DO/EO/US 

MCKEOWN, Kathleen R. andJESG, Hongyan 



i 



Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 

1 . H This is a FIRST submission of items concerning a filing under 35 U.S.C. 371 . 

2. □ This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 37 1 . 

3. □ This is an express request to begin national examination procedures (35 U.S.C. 371(f)). The submission must include itens (5% 

(6), (9) and (24) indicated below. 

4. S The US has been elected by the expiration of 1 9 months from the priority date (Article 3 1 ). 

5. ^ A copy of the International Application as filed (35 U.S.C. 371 (c) (2)) 

a. S is attached hereto (required only if not communicated by the International Bureau). 

b. □ has been communicated by the International Bureau, 

c- □ is not required, as the application was filed in the United States Receiving Office (RO/US). 

□ An English language translation of the International Application as filed (35 U.S.C. 371 (c)(2)). 
U a. □ is attached hereto. 

b. □ has been previously submitted under 35 U.S.C. 154(d)(4). 
f1. S Amendments to the claims of the International Application under PCT Article 19 (35 U.S.C. 371 (c)(3)) 

a. □ are attached hereto (required only if not communicated by the International Bureau). 

b. □ have been communicated by the International Bureau. 

c. □ have not been made; however, the time limit for making such amendments has NOT expired. 

d. S have not been made and will not be made. 
■JD An English language translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371(c)(3)). 

□ An oath or declaration of the inventor(s) (35 U.S.C. 371 (c)(4)). 

□ An English language translation of the annexes of the International Preliminary Examination Report under PCT 
Article 36 (35 U.S.C. 371 (c)(5)). 



m. 
14. 
m. 

W- 

12, 

13. 
14. 
15. 
16. 
17. 
18. 
19. 
20. 
21. 
22. 
23. 



A copy of the International Preliminary Examination Report (PCT/IPEA/409). 
K A copy of the International Search Report (PCT/IS A/2 1 0). 

Items 13 to 20 below concern document(s) or information included: 

□ An Information Disclosure Statement under 37 CFR 1.97 and 1 .98. 

□ An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is included. 

□ A FIRST preliminary amendment. 

□ A SECOND or SUBSEQUENT preliminary amendment. 

□ A substitute specification. 

□ A change of power of attorney and/or address letter. 

□ A computer-readable form of the sequence listing in accordance with PCT Rule 13ter.2 and 35 U.S.C. 1.821 - 1.825. 

□ A second copy of the published intemational application under 35 U.S.C. 154(d)(4). 

□ A second copy of the English language translation of the intemational application under 35 U.S.C. 154(d)(4), 

S Certificate of Mailing by Express Mail 

Ki Other items or information: 

Form PCT/IB/301; Form PCT/IB/304; Form PCT/IPEA/401 ; Amendment Under Article 34; Form PCT/IPEA/402; form 
PCT/IB/308; Form PCT/IB/332; Form PCT/IPEA/408; Response to Written Opinion; a postcard and a check in the 
amount of $538.00. 

Express MaO Label No.: EK839853032US Date of Deposit: August 16, 2001 



Page 1 of 2 



PCTUS1/REV03 



JC03 Rector 



16 AUG200J 



U.S. APPLICATION 



INTERNATIONAL APPLICATION NO. 
PCT/USOO/04505 



ATTORNEY'S DOCKET NUMBER 
A32312-PCT USA 



24. The following fees are submitted:. 

BASIC NATIONAL FEE ( 37 CFR 1.492 (a) (1) - (5)) : 

□ Neither international preliminary examination fee (37 CFR 1 .482) nor 
international search fee (37 CFR L445(a)(2)) paid to USPTO 
and Intemational Search Report not prepared by the EPO or JPO 



□ 



n 



□ 



International preliminary examination fee (37 CFR 1.482) not paid to 
USPTO but Intemational Search Report prepared by the EPO or JPO 

Intemational preliminary examination fee (37 CFR 1 .482) not paid to USPTO 
but intemational search fee (37 CFR l,445(a)(2)) paid to USPTO 

Intemational preliminary examination fee (37 CFR 1 .482) paid to USPTO 
but all claims did not satisfy provisions of PCX Article 33(1 )-(4) 



$1000.00 



$860.00 



$710.00 



$690.00 



Intemational preliminary examination fee (37 CFR 1.482) paid to USPTO 
and all claims satisfied provisions of PCX Article 33(1 )-(4) 



$100.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 



CALCULATIONS PTO USE ONLY 



$690.00 



Surcharge of $130.00 for furnishing the oath or declaration later than □ 20 

months from the earliest claimed priority date (37 CFR 1 .492 (e)). 



□ 30 



$0.00 



CLAIMS 



NUMBER FILED 



NUMBER EXTRA 



RATE 



Total claims 



37 



■20- 



17 



X $18.00 



$306.00 



Independent claims 



- 3 = 



$80.00 



$80.00 



Multiple Dependent Claims (check if applicable). 



□ 



$0.00 



TOTAL OF ABOVE CALCULATIONS = 



$1,076.00 



g] Applicant claims small entity status. (See 37 CFR 1 .27). The fees indicated above are 
reduced by 1/2. 



$538.00 



SUBTOTAL = 



$538.00 



!Siil>cessing fee of $130.00 for furnishing the English translation later than □ 20 
InlDnths from the earliest claimed priority date (37 CFR 1.492 (f)). 



□ 30 



$0.00 



TOTAL NATIONAL FEE = 



$538.00 



^ for recording the enclosed assignment (37 CFR 1.21(h)). The assignment must be 
alS^ompanied by an appropriate cover sheet (37 CFR 3.28, 3.3 1) (check if applicable). 



□ 



$0.00 



TOTAL FEES ENCLOSED = 



$538.00 



Amount to be: 
refunded 



charged 



A check in the amount of 



$538.00 



b. 



□ 
□ 



Please charge my Deposit Account No. _ 



_to cover the above fees is enclosed, 
in the amount of 



to cover the above fees. 



A duplicate copy of this sheet is enclosed. 

The Commissioner is hereby authorized to charge any additional fees v/hich may be required, or credit any overpayment 
to Deposit Account No, 02-4377 A duplicate copy of this sheet is enclosed. 

Fees are to be charged to a credit card. WARNING: Information on this form may become public. Credit card 
information should not be included on this form. Provide credit card information and authorization on PTO-203S. 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive (37 CFR 
1.137(a) or (b)) must be filed and granted to restore the application to pending status. 

SEND ALL CORRESPONDENCE TO: 



Henry Tang 
BAKER BOTTS LLP 
30 Rockefeller Plaza 
New York, NY 10112-0228 




sSsature" 

Paul D. Ackerman 



NAME 
39,891 



REGISTRATION NUMBER 

August 16, 2001 

DATE 



Page 2 of 2 



wo 01/06408 



CUT AND PASTE DOCUMENT SUMMARIZATIO N 
SYSTEM AND METHOD 

Statement of Government Rights 

The United States Government may have certain rights to the invention 
5 set forth herein pursuant to a grant by the National Science Foundation, Contract No. 
IRI-96-198124 



Statement of Related Applications 

This application claims the benefit of United States provisional patent 
application, Serial No. 60/120,657, entitled "Summary Generation Through Intelligent 
1 0 Cutting and Pasting of the Input Document" which was filed on February 1 9, 1 999. 



Field of the Invention 

The present invention relates generally to information summarization 
and more particularly relates to systems and methods for generating a summary of a 
document using automated cutting and pasting of the input document. 

15 Background of the Invention 

The amount of information available today drastically exceeds that of 
any other time in history. With the continuing expansion of the Internet, this trend 
will likely continue well into the future. Often, people conducting research of a topic 
are faced with information overload as the number of potentially relevant documents 

20 exceeds the researcher's ability to individually review each document. To address this 
problem, information summaries are often relied on by researchers to quickly evaluate 
a document to determine if it is truly relevant to the problem at hand. 

Given the vast collection of documents available, there is interest in 
developing and improving the systems and methods used to summarize information 

25 content. For individual documents, domain-dependent template based systems and 
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domain-independent sentence extraction methods are known. Such known systems 
can provide a reasonable svunmary of a single document when the domain is known. 

Many presently available summarizers extract sentences from the 
original documents to produce summaries. However^ since the sentences are 
5 generally extracted without supporting context information, the resulting siimmaries 
can be incoherent, and in some cases, can convey misleading information. 

Therefore, there remains a need for systems and methods which can 
generate a more readable and concise summary of a document 

Summary of the Invention 

10 It is an object of the present invention to provide a system and method 

for generating a suxmnary of a document. 

It is another object of the present invention to provide a summarization 
system which extracts sentences from an input document and then transforms the 
extracted sentences such that a concise, coherent and accurate summary results, 

15 It is a further object of the present invention to provide a system and 

method for generating a summary of a set document which use automated cutting and 
pasting of the input document. 

A present method for generating a summary of an input document 
includes extracting at least one sentence from the document. The extracted sentences 

20 are parsed into components, preferably in a parse tree representation. Sentence 

reduction is performed to mark components which can be removed from the extracted 
sentences. Sentence combination is performed to mark components of two or more 
sentences which can be merged. Sentence combination also includes a paste operation 
to operate on the marked components to effect the indicated removal and combination 

25 of sentence components. 

A preferred sentence reduction operation includes measuring the 
contextual importance of the components; measuring the probabilistic importance of 
the components based on a given corpus; measuring the importance of the 
components based on linguistic knowledge; synthesizing the contextual, probabilistic 

30 and knowledge based importance measures into a relative importance score for each 
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component; and marking for removal those components with an importance score 
below a threshold value. 

The contextual importance can be measured by establishing a plurality 
of lexical links of at least one type among the components in a local context in the 
5 document and computing a context importance score according to the type, number 
and direction of lexical links associated with each component The types of lexical 
links can include repetition, inflectional variants, derivational variants, synonyms, 
hypemyms, antonyms, part-of, entailment, and causative links. 

In a preferred method, the sentence combination operation includes 
10 identifying sentence combination operations from a sentence combination subcorpus 
and developing rules regarding the application of the sentence combination 
operations. The combination rules are then applied to the extmcted sentences after 
sentence reduction to identify and merge suitable sentences from the original article. 
The sentence combination operations can be selected from the group including add 
15 descriptions, aggregations, substitute incoherent phrases, substitute phrases with more 
general or more specific information, and mixed operations. 

A present system for generating a summary of an input document 
includes an extraction module which receives the input document and extracts at least 
one sentence related to a focus of the document. A summary sentence generation 
20 module is provided, which generally includes a sentence reduction module and a 

sentence combination module. The system includes a grammatical parser operatively 
coupled to the generation module for parsing the extracted sentences into components 
in a grammatical representation. A combined lexicon and a corpus of human 
generated summaries are operatively coupled to the generation module for use by the 
25 operational modules during summary generation. 

The corpus can further include a sentence generation subcorpus and a 
sentence reduction subcorpus. The subcorpora can be generated manually or through 
the use of a decomposition module. 

Preferably, the sentence reduction module is cooperatively engaged 
30 with the combined lexicon and performs context importance processing on the 

components of the grammatical representation. Context importmice processing can 
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include establishing a plurality of lexical links of at least one type for the components 
and generating a context importance score based on the type and number of links 
associated with the components. The number and type of lexical links can vary, 
however a preferred set of lexical link types includes repetition, inflectional variants, 
5 derivational variants, synonyms, hypemyms, antonyms, part-of, entailment, and 
causative links. 

Preferably, the sentence reduction module further computes the relative 
importance of the components based on linguistic knowledge stored in the combined 
lexicon. The sentence reduction module can also be cooperatively engaged with the 

1 0 corpus and perform probabilistic importance processing on the components of the 
grammatical representation in accordance with the particular corpus used. 

The sentence combination module can be used to identify sentence 
combination operations from a sentence combination subcorpus and develop rules 
regarding the application of the sentence combination operations. The combination 

15 module applies the combination rules to the extracted sentences after sentence 
reduction to identify and merge suitable sentences from the original article. 

A decomposition module in accordance with the present application 
can be used to evaluate human generated summaries and map corresponding portions 
of the summaries to the original documents. The decomposition module indexes 

20 words in the summary and the original document. A Hidden Markov Model is then 
built based on heuristic rules to determine the probability of phrases in the summary 
sentence matching a given phrase in the original document. A Viterbi algorithm can 
then be employed to determine the best solution for the Hidden Markov Model and 
generate a mapping between summary phrases and the original document. This 

25 mapping can be used to generate, among other things, a sentence reduction subcorpus 
and a sentence combination subcorpus. Such a decomposition module can be 
operatively coupled to the corpus in the summary generation system described above. 
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Brief Description of the Drawing 

Further objects, features and advantages of the invention will become 
apparent from the following detailed description taken in conjunction with the 
accompanying figures showing illustrative embodiments of the invention, in which 
5 Figure 1 is a block diagram of the system architecture of the present 

document summarization system; 

Figure 2 is a flow chart illustrating an exemplary embodiment of a 
sentence reduction operation in accordance with the summarization system of Figxire 

1; 

1 0 Figure 3 is a pictorial diagram of an exemplary parse tree sentence 

representation; 

Figure 4 is a flow chart illustrating an exemplary embodiment of a 
sentence combination operation in accordance with the present summarization system 
of Figure 1; 

15 Figure 5 is a table illustrating exemplary sentence combination 

operations for the sentence combination operation of Figure 4; 

Figure 6 is a table illustrating exemplary sentence combination rules 
for applying the sentence combination operations of Figure 5; 

Figure 7 is a flow diagram illustrating the operation of the corpus 
20 decomposition module of Figure 1 ; and 

Figure 8 is a pictorial diagram of a Hidden Markov Model for use in a 
corpus decomposition module. 

Throughout the figures, the same reference numerals and characters, 
25 unless otherwise stated, are used to denote like features, elements, components or 
portions of the illustrated embodiments. Moreover, while the subject invention will 
now be described in detail with reference to the figures, it is done so in connection 
with the illustrative embodiments. It is intended that changes and modifications can 
be made to the described embodiments without departing from the true scope and 
30 spirit of the subject invention as defined by the appended claims. 
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Detailed Description of Preferred Embodiments 

The present summarization systems and methods generate a generic, 
domain-independent single-document summary of a received input document. Figure 
1 is a block diagram illustrating the system architecture of an exemplary embodiment 

5 of the present summarization system. Such a system can be implemented on various 
computer hardv^are, software and operating system platforms. The particular system 
components selected are not critical to the practice of the present invention. For 
example, the present system of Figure 1 , can be implemented on a personal computer 
system, such as an IBM compatible system. 

10 Referring to Figure 1, an input document 105 in computer readable 

form is applied to an extraction module 110 which determines the focus of the 
document 105 and extracts sentences from the document accordingly. A number of 
extraction techniques can be used in the extraction module 110. In a preferred 
embodiment, the extraction module 110 links words in a sentence to other words in 

15 the input document 105 through repetitions, morphological relations and lexical 

relations. An importance score can then be computed for each word in the article 1 05 
based on the number, type and direction (forward, backward) of the lexical links 
associated with the word. A sentence score can be determined by adding the 
importance score for each of the words in the sentence and normalizing the sum based 

20 on the number of words in the sentence. The sentences can then extracted based on 
the highest relative sentence scores. 

The extraction module 1 10 provides the extracted sentences 1 15 to a 
generation module 120. The generation module 120 also receives the original 
document 105 as an input. The generation module 120 further includes a sentence 

25 reduction module 135 and a sentence combination module 140. The sentence 

reduction module 135 provides a marked up parse tree as input data for the sentence 
combination module 140, which generates and ou^uts the summary sentences. 

The generation module 120 is operatively coupled to a corpus of 
himian-written summaries 165, a lexical database 170, and a combined reusable 

30 lexicon 175. 
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The corpus 165 generally includes a broad collection of hiunan- 
generated summaries as well as the corresponding original documents. The corpus 
165 can also include a sentence reduction subcorpus 165a and a sentence combination 
subcorpus 165b which can be generated manually or through a decomposition 
5 module. The sentence reduction subcorpus 165a includes entries of sentence pairs 
linking an original sentence to a himian reduced sentence. The sentence combination 
subcorpus 165b includes mappings from human combined sentences to two or more 
original sentences. 

A suitable exemplary corpus 165 was generated using 
10 Communications-related Headlines^ a free daily online news service provided by the 
Benton Foundation (http://vww.benton.org). The articles from this particular service 
are communication related, but the topics involved are very broad, including law, 
company mergers, new technologies, labor issues and so on. Of course, other sources 
of document summaries can also be used to generate a suitable corpus. To insure that 
1 5 the resulting corpus is somewhat generic, the articles from the selected source should 
not possess a particular writing style. Thus, preferred sources feature articles from 
multiple sources or articles from various sections of one or more source. A suitable 
corpus 165 was generated in foxir major steps. First, human-written, single document 
summaries are received from the source. Second, the original documents are retrieved 
20 and correlated to the respective summary. The retrieved documents are then 
^*cleaned" by removing irrelevant material such as indexes and advertisements. 
Finally, the quality of the correspondence between the summary and the original 
document is verified. The cleaning and verification processes are generally performed 
manually. The sentence reduction subcorpus i65a and sentence combination 
25 subcorpus 165b entries were generated by the decomposition module 1 85, the 
operation of which is explained below. 

The lexical database 170 can take the form of the WordNet database, 
which is described in the article "WordNet: A lexical Database for English", by G.A. 
Miller, Communications of the ACM, Vol. 38, No. 11. pp. 39-41, November 1995. A 
30 suitable embodiment of the combined lexicon 175 can be constructed by combining 
multiple, large-scale resources such as WordNet, the English Verb Classes and 
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Alternations (EVCA) database, the COMLEX syntax dictionary and the Brown 
Corpus tagged with WordNet senses. The combined lexicon 175 can be formed by 
encoding the EVCA database with COMLEX compatible syntax and merging the 
EVCA into the COMLEX database. This results in each verb in the combined 
5 lexicon 175 being marked with a list of subcategorizations and alternate syntactic 

patterns. Preferably, WordNet is added to the EVCA/COMLEX combination to refine 
the syntactic information and provide additional lexical information to the lexicon 
175. 

The generation module 120 is also cooperatively coupled to natxiral 

10 language processing (NLP) tools such as a syntactic parser 180 and a co-reference 
resolving module 190 which can include anaphora resolution. These tools can be 
software modules which are called by the generation module 120. A suitable 
syntactic parser 180 is the English Slot Grammar (ESG) parser available from 
International Business Machines, Inc. A suitable co-reference resolving module 190 

15 is the Deep Read system, available from Mitre, Inc. 

Figure 2 is a flow diagram further illustrating the operation of the 
sentence reduction module 135. The reduction module 135 receives extracted 
sentences 1 1 5 as input (step 205). The reduction module invokes the parser 1 80 to 
grammatically parse the extracted sentences 1 1 5 and generate a parse tree 

20 representation of the sentences (step 2 1 0). In step 215 contextual importance is 
determined by detecting lexical Imks among words in a local context and then 
computing an importance score based on the number, type and direction of lexical 
links detected. The context processing step 215 generates an importance score for 
each node in the parse tree indicating the relative importance of the nodes to the focus 

25 of the input document 1 05 , 

The nimiber, type and direction (forward, backward) of lexical links 
used in the practice of the present invention may vary. An empirical study has 
demonstrated that the following nine lexical relation types provide a meaningful 
representation of contextual importance: (1) repetition, (2) inflectional variants, (3) 

30 derivational variants, (4) synonyms, (5) hypemyms, (6) antonyms, (7) part-of, (8) 
entailment (for example: Mil ^ die), and (9) causative (for example: eat — ^ chew). 
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Inflectional variants (2) and derivational variants can be derived from the CELEX 
database content, available from the Centre for Lexical Information, Max Planck 
Institute for Psycholinquistics, Nijmegen, which can be in the combined lexicon 175. 
The other lexical relations can be extracted using the separate lexical database 170, 
5 such as WordNet, To frame the local context of a word, a number of sentences before 
and after the current sentence location are evaluated for the presence of lexical links. 
The nimiber of sentences selected for this operation involves balancing the level of 
contextual depth to the amount of processing overhead. Using the five sentences 
before and the five sentences after the current sentence has been found to provide 

10 reasonable local context without incurring excessive processing overhead. 

After the lexical links have been identified (step 215 a), an importance 
score for each word in the extracted sentences can be calculated (step 215 b). Lexical 
links from the current sentence to subsequent sentences are referred to as forward 
links and those from the current sentence to preceding sentences are referred to as 

15 backward links. The importance score, referred to as the context weight, can be 
computed as follows: 

9 

1) ForwardWeight(w) = S {WixLi(w)) 

i = 1 

9 

2) BackwardWeightiw) = Z {WixBnumi{w)) 

i = 1 

3) TotalWeight(w) = ForwardWeight(w) -f BackwardWeight{w) 

mzx(ForwardWeight(w\ Backward {weight (w)) 

4) Ralio{w) = — — 

TotaIweighi(w)) 

5) ContextWeight = Ratio{w)xTotaiWeight{w) 



where ForwardWeight(w) computes the weight of forward links, BackwardWeight(w) 
computes the weight of backward links, TotaIWeight(w) represents the sum of all 
links and Ratio(W) computes a weight for the location of the word. To compute the 
20 weight of various lexical links, each type of link is assigned a weighted value 

according to its relative importance. For example, the nine lexical relations set forth 
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above were presented in descending order of importance and accordingly can be 
assigned linearly decreasing weights such as (1,0.9, 0.8, 0.7, 0.6, 0.5, 0.4, 03, 0.2). 
The value of Ratio(w) represents the value assigned based on the location of the word 
in the original document. For example, when a sentence introduces a topic or ends a 
5 topic, it is considered more important and the components of those sentences will be 
assigned a relatively higher location value. 

The use of various types of lexical relations improves the relatedness 
of a word to the main topic. Although simple relations like repetition and synonymy 
can be used to determine a measure of contextual importance, these surface relations 
1 0 are generally unable to detect more subtle connections between words. 

Folio vdng context processing (step 215) the reduction module 13 5. can 
rf perform interdependency processing using a probability analysis based on the corpus 
0 165 of human- written reduction based sentences. Such an analysis can indicate the 
jj degree of correlation between components in a sentence, such as the relationship 
J 5 between a verb and its subclause, 

S ^ The probability computation can be performed based on parse trees 

□ using probabilities to indicate the degree of correlation between a parent node and its 
3: child nodes in the parse tree. Figure 3 illustrates an exemplary fragment of a parse 

tree used to explain tiie operation of the probability computation. In Fig. 3, The main 
niO verb "give" is the parent nodes 300, and it has four children nodes: subclause conjunct 
305, subject 310, indirect object 3 15 and object 320, respectively. The parse tree can 
also include further levels below the children nodes, such as nodes ndet 325 and adjp 
330 below child node obj 320 and nodes Iconj 335 and rconj 340 below node adjp 
330, respectively. 

25 To measure the interdependency between the verb give and its 

subclause 305, the probability that the subclause is removed when the verb is give, 
can be represented by PROB{"when_clause is removed*\verb = give) . This conditional 
probability is transformed using Bayes's rule to: 

. . I I=roBC'v = give'f'when daLBerefTiDwecr')*FFDB(|"when daLBeisremovecf') 

FTOBC when_dau9e is renrowecTf v = give") ~ = — ~ : — : = 

[ RHDB(verb = give) 
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In a similar fashion, the probabilities that a clause will be reduced or remain 
unchanged can be calculated in a similar manner. 

The probability associated with the other child nodes from the current 
root node is calculated in a similar manner. After the probabilities for each of the first 

5 level child nodes is calculated, each of the child nodes in the current level of the tree 
is then treated as a parent node and the process is repeated through each descending 
level of the parse tree until every parent-child node pair has been considered. The 
probabilities for the corpus 165 can be calculated and stored in a look-up table which 
is used when a reduction module 135 is run. 

10 The context processing of step 215 and probability processing of step 

220 provide a relative ranking of sentence components. However, this ranking does 
not necessarily provide a measure of which components be included to provide a 
granomatically correct summary sentence. Thus, preferably, after the probability 
analysis of step 220, reduction processing based on linguistic knowledge is 

1 5 performed (step 225). In this operation, the reduction module 135 works in 
cooperation with the combined lexicon 175. 

The linguistic knowledge processing step 225 operates with the 
combined lexicon 1 75 to evaluate the parse tree for each extracted sentence 115 and 
determine which children nodes are essential to maintain the grammatical correctness 

20 of the component represented by the parent node. Linguistic judgments are identified 
in the parse tree by assigning a binary tag to each node in the parse tree. The value of 
a tag is either essential or reducible, indicating whether or not a node is indispensable 
to its parent node. For example, referring to Figure 3, the lexicon 175 will indicate 
that the verb give needs a subject and two objects. Thus the child nodes subj 310, iobj 

25 315 and obj 320 can be marked as essential. In this case, the child node subclause 305 
is then rendered non-essential and will be marked as reducible. The lexicon 175 can 
also include collocations, such as consist of or replace .... mth which prevents 
removal of indispensable components. 

Once the linguistic knowledge processing is applied in step 225, a 

30 reduction operation (step 230) can take place. The reduction operation process can be 
viewed as a series of decision making steps along the edges of a parse tree. Beginning 
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with the root node of the parse tree, the immediate child nodes are evaluated to 
determine which child nodes can be removed. A child node can be removed if three 
conditions are satisfied. The first condition is that the component is not a local focus. 
To determine whether a component is a local focus, the ratio of the context 

5 importance score (step 215b) of the child node to that of the root node is calculated. 
The child node is then considered unimportant if the calculated ratio is smaller than a 
threshold value. The second condition is that the corpus probability value (step 220) 
indicating that the special syntactic component of the root is removed is higher than a 
threshold. The final condition is that the linguistic analysis in step 225 indicates that 

10 the child node as reducible. 

When the conditions to remove a child node are satisfied, the child 
node is tagged as "removable" and processing on that branch of the tree terminates. 
For the child nodes which are retained, the lower levels of the parse tree are evaluated 
by repeating this process in a similar mamer through the tree. The reduction 

1 5 operation step 230 is complete when there are no more nodes to consider. This also 
concludes processing of the sentence reduction module and results in the parse trees 
being marked with those components which can be removed or altered by the 
subsequent paste module 150 operation. 

Following processing by the sentence reduction module 135, 

20 processing by the sentence combination module 140 is performed. The operation of 
the sentence combination module 140 is fiirther illustrated in the flow chart of Figure 
4. 

Using the sentence combination subcorpusl65b, the sentence 
combination module evaluates the extracted sentence to identify applicable sentence 
25 combination operations (step 410). Figure 5 is a table illustrating combination 
operations such as: add descriptions 510, aggregations 515, substitute incoherent 
phrases 520, substitute phrases with more general or more specific information 525 
and mixed operations 530. 

From the sentence combination subcorpus 165b, sentence combination 
30 rules are also established to determine whether and how the sentence combination 
operations of step 410 will take place (step 415). The result is a set of sentence 
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combination rules 420, such as those set forth in Figure 6. The rules illustrated in 
Figure 6 are exemplary and non-exhaustive. These sentence combination rules 420 
were determined empirically by manual inspection of the sentence combination 
subcorpus 165b. Using the input article 105 and the extracted sentences reduced by 

5 the sentence reduction module 135 the sentence combination module 140 in 
cooperation with the co-reference resolution module 190 applies the sentence 
combination rules 420 (step 425), The result of step 425 is that the parse trees of the 
sentences being combined are appropriately tagged to effect the sentence combination. 
The combination operation is then realized in step 430 using a tree adjoining grammar 

10 (TAG) formalism, as described by A. Joshi, "Introduction to Tree- Adjoining 

Granamars," in Mathematics of Language, John Benjamins, Amsterdam, 1987. In this 
way, the sentence combination module 140 performs a paste operation on the marked 
parse trees and generates a summary sentence. 

The document summary is generated by combining the smnmary 

15 sentences. The most straight forward combination is to maintain the order of 

sentences as they were extracted, however, other sequencing arrangements can also be 
employed. 

As noted above in connection with Figure 1^ the corpus decomposition 
module 185 operates on the corpus 165 to generate the sentence reduction subcorpus 

20 165a and the sentence combination subcorpus 165b, The decomposition module 185 
generally operates to evaluate the human written summaries in the corpus 165, 
compare the summary sentences to the original document, determine if a svmmiary 
sentence was generated by a cut and past operation and identify where the components 
of tiie summary sentences were taken from in the original docimients. The operation 

25 of the decomposition module 185 is illustrated in the flow diagram of Figure 7. 

Referring to Figure 7, the decomposition module 185 uses the human- 
generated summary and original document as inputs to an indexing operation (step 
705). During indexing, each word in the original document is indexed according to its 
positions in the original document. A convenient way of referencing these 

30 occimrences is by sentence number and word number in the original docxmient. 
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To evaluate the index of words, a set of hexiristic rules is developed by 
manual inspection of the corpus 165, Such inspection reveals that hxunan-generated 
summaries often include one or more of six operations: sentence reduction, sentence 
combination, syntactic transformation, lexical paraphrasing, 

5 generalization/specification, and content reordering. The heuristic rules can be 
represented using a bigram probability PROB (PF^ = (5^, fT^) | = (5;, Wj)) 
(abbreviated as PROB(W2\ Wj) in the foliov^ng discussion). The probability values can 
be assigned in the foUoMdng manner: 

•/F {(Sj = S^and{Wj =W2- 1)) (i.e., the words are in two adjacent positions 

10 in the document), THENPROBiW^] W^) is assigned the maximal value, PL(Rule: Two 
adjacent words in the summary are most likely to come from two adjacent words in 
the docimient.) 

mIF ((5; = S2)and(Wj < PF^ - 1)), THENPROBiW^l Wj) is assigned the second 
highest value, P2. (Rule: Adjacent words in the summary are highly likely to come 
1 5 from the same sentence in the document, retaining their relative precedent relation, as 
in sentence reduction. This rule can be further refined by adding restrictions on 
distance between words.) 

•/F ((S, = S2)and(Wj > W^)), THEN PROBiW^] Wj) is assigned the third highest 
value, P3. (Rule: Adjacent words in the simrnaary are likely to come from the same 
20 sentence in the document but reverse their relative orders, such as in the case of 
sentence reduction with syntactic transformations.) 

•/F(S^ - CONST <S, < S2X THENPR0B{W2\ Wj) is assigned the fourth 
highest value, P4. (Rule: Adjacent words in the summary can come from nearby 
sentences in the document and retain their relative order, such as in sentence 
25 combination. CONST is a small constant such as 3 or 5.) 

mIF {S2 <S;<S2 + CONST), THEN FROBiW^ \ Wj) is assigned the fifth 
highest value, P5. (Rule: Adjacent words in the summary can come from nearby 
sentences in the document but reverse their relative orders.) 

mIF(\S2'Sj\> CONST) THENPROB^W^ \ Wj) is assigned a small value, P6. 
30 (Rule: Adjacent words in the summary are not very likely to come from sentences far 
apart.) 
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Based on the above heviristic principles, a Hidden Markov Model can 
be generated, such as is illustrated in Figure 8 (step 710). The nodes in the Hidden 
Markov Model represent possible positions in the document, and the edges ou^ut the 
probability of going from one node to another. This Hidden Markov Model is used in 
5 finding the most likely position sequence in a subsequent processing operation. 

Assigning values to P1-P6 is performed empirically. For example, the maximal value 
can be assigned 1 and others are assigned evenly decreasing values 0.9, 0.8 and so on. 
The order of the above rules is based on the empirical observations on a particular set 
of summaries. These values, however, can be adjusted or even trained for different 
10 corpora. 

A Viterbi algorithm can be used to evaluate the Hidden Markov 
Model and find the most likely sequence of words incrementally (step 715). The 
Viterbi algorithm first finds the most likely sequence for {WordjWord^), for each 
possible position of Word2. This information is then used to compute the most likely 

1 5 sequence for {Word, Word.Word,), for each possible position of Word^. The process 
repeats until all the words in the sequence have been considered. 

After evaluation by the Viterbi algorithm, post-editing operations can 
be used to cancel mismatches that occur in the corpus analysis. The resuh is that 
summary sentences are matched to the corresponding phrases in the document. Once 

20 the summary sentences are so matched, it is a simple endeavor to sort the various 

matchings to one of the sentence reduction subcorpus 165a and sentence combination 
subcorpus 165b. In addition, the decomposition module 185 can be used as a stand 
alone tool, apart from the rest of the present summary generation system, to perform 
various summary analysis operations. 

Although the present invention has been described in connection with 
specific exemplary embodiments, it should be understood that various changes, 
substitutions and alterations can be made to the disclosed embodiments without 
departing from the spirit and scope of the mvention as set forth in the appended 
claims. 
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CLAIMS 



1 . A system for generating a summary of an input document comprising: 
an extraction module, the extraction module receiving the input 

document and extracting at least one sentence related to a focus of the document; 

a summary sentence generation module operatively coupled to the 

extraction module; 

a grammatical parser operatively coupled to the generation module for 
parsing the extracted sentences into components in a granmiatical representation; 

a combined lexicon operatively coupled to the generation module; and 
a corpus of himian generated summaries operatively coupled to the 
generation module. 



2. The system for generating a summary of an input document of claim 1 , 
wherein the generation module further comprises a sentence reduction module. 

3 . The system for generating a sxmmaary of an input document of claim 2, 
wherein the sentence reduction module is cooperatively engaged with the corpus and 
performs probabilistic importance processing on the components of the grammatical 
representation in accordance with the corpus. 

4. The system for generating a summary of an input document of claim 3, 
wherein the sentence reduction module is cooperatively engaged with the combined 
lexicon and performs context importance processing on the components of the 
grammatical representation. 



5. The system for generating a summary of an input document of claim 4, 
wherein the context importance processing includes establishing a plurality of lexical 
links of a least one type for the components and generating a context importance score 
based on the type and number of links associated with the components. 
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6. The system for generating a summary of an input document of claim 5 , 
wherein the sentence reduction module further computes the relative importance of 
the components based on linguistic knowledge stored in the combined lexicon. 



7. The system for generating a sxmmiary of an input document of claim 1 , 
wherein the generation module further comprises a sentence combination module. 

8. The system for generating a summary of claim 7, wherein the sentence 
combination module is operatively coupled to the corpus and wherein the sentence 
combination module: 

identifies at least one sentence combination operation; 
establishes at least one rule for applying the sentence combination 

operation; and 

applies the at least one rule to combine at least two extracted 

sentences. 



9. The system for generating a summary of claim 8, wherein the at least one 
1 5 sentence combination operation is selected from the group consisting of add 

descriptions, aggregations, substitute incoherent phrases, substitute phrases with more 
general or more specific information, and mixed operations. 

1 0. The system for generating a summary of claim 9, wherein the at least one rule 
to combine extracted sentences includes replacing a partial name phrase with a full 

20 name phrase. 



1 1 . The method of generating a summary of claim 1 0, wherein the at least one rule 
to combine extracted sentences includes determining if two sentences having a 
common subject are proximate and whether at least one sentence is marked for 
reduction then removing the subject of the second sentence and combining with the 
25 first sentence using the connective "and." 
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12. The system for generating a summary of an input document of claim 1 , 
wherein the generation module further comprises a sentence reduction module and a 
sentence combination module. 

13. The system for generating a suxxmiary of an input document of claim 12, 

5 wherein the sentence reduction module is cooperatively engaged with the combined 
lexicon and performs context importance processing on the components of the 
grammatical representation. 

14. The system for generating a summary of an input document of claim 13, 
wherein the context importance processing includes establishing a plurality of lexical 

10 links of a least one type for the components and generating a context importance score 
based on the type and number of links associated with the components. 

15. The system for generating a summary of an input dociament of claim 14, 
wherein the sentence reduction module further computes the relative importance of 
the components based on linguistic knowledge stored in the combined lexicon. 

15 16. The system for generating a summary of an input document of claim 1 5 , 

wherein the sentence reduction module is cooperatively engaged with the corpus and 
performs probabilistic importance processing on the components of the grammatical 
representation in accordance with the corpus. 

1 7. The system for generating a summary of an input document of claun 12, 
20 wherein the sentence combination module is operatively coupled to the corpus and 
wherein the sentence combination module: 

identifies at least one sentence combination operation; 
establishes at least one rule for applying the sentence combination 

operation; and 

25 applies the at least one rule to combine at least two extracted 

sentences. 
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18. The system for generating a summary of claim 17, wherein the at least one 
sentence combination operation is selected from the group consisting of add 
descriptions, aggregations, substitute incoherent phrases, substitute phrases with more 
general or more specific information, and mixed operations. 

5 19. The system for generating a summary of claim 1 8, wherein the at least one 
rule to combine extracted sentences mcludes replacing a partial name phrase with a 
full name phrase. 

20. The method of generating a summary of claim 19, wherein the at least one rule 
to combine extracted sentences includes determining if two sentences having a 

1 0 common subject are proximate and whether at least one sentence is marked for 

reduction then removing the subject of the second sentence and combining with the 
first sentence using the connective "and." 

21. The system for generating a summary of an input document of claim 1 , further 
comprising a decomposition module operatively coupled to the corpus, the 

1 5 decomposition module analyzing the corpus and generating a sentence reduction 
subcorpus and a sentence combination subcorpus. 

22. A method of generating a summary of an input document comprising: 

extracting at least one sentence from the document; 
parsing the at least one sentence into components; 
20 performing a sentence reduction operation to mark components which 

can be removed from the sentence; 

performing a sentence combination operation to mark components of at 
least two sentences which can be merged; and 

operating on the marked components to effect the indicated removal 
25 and combination of sentence components. 
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23. The method of generating a summary of claim 22, wherein the sentence 
reduction operation comprises: 

measuring the contextual importance of the components; 

measuring the probabilistic importance of the components based on a 

5 given corpus; 

measuring the importance of the components based on linguistic 

knowledge; 

synthesizing the contextual, probabilistic and knowledge based 
importance measures into a relative importance score for each component; and 
1 0 marking those components having an importance score below a 

threshold value for removal. 

24. The method of generating a summary of claim 23, wherein the contextual 
importance is measured by: 

identifying a plurality of lexical links of at least one type among the 
15 components in a local context in the docximent; and 

computing a content importance score according to the type and 
number of lexical links associated with each component. 

25. The method of generating a summary of claim 24, wherein the at least one type 
of lexical links are selected from the group consisting of repetition, inflectional 

20 variants, derivational variants, synonyms, hypemyms, antonyms, part-of, entailment, 
and causative links. 

26. The method of generating a summary of claim 23, wherem the probabilistic 
importance score is determined based on a corpus of human-written summaries. 

27. The method of generating a summary of claim 23, wherein the linguistic 
25 knowledge operation includes the use of a combined lexicon. 
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28 , The method of generating a summary of claim 22, wherein the sentence 
combination operation further comprises: 

identifying at least one sentence combination operation; 

establishing at least one rule for applying the sentence combination 

5 operation; and 

applying the at least one rule to combine at least two extracted 

sentences. 



29. The method of generating a summary of claim 28, wherein the at least one 
sentence combination operation is selected from the group consisting of add 

10 descriptions, aggregations, substitute incoherent phrases, substitute phrases with more 
general or more specific information, and mixed operations. 

30. The method of generating a summary of claim 28, wherein the at least one 
rule to combine extracted sentences includes replacing a partial name phrase with a 
full name phrase. 

1 5 3 1 . The method of generating a summary of claim 28, wherein the at least one rule 
to combine extracted sentences includes determining if two sentences having a 
common subject are proximate and whether at least one sentence is marked for 
reduction then removing the subject of the second sentence and combming with the 
first sentence using the coimective "and." 

20 32. A method of identifymg correspondence between phrases in a sentence in a 
summary and phrases in the original document corresponding to the summary 
comprising: 

establishing a plurality of heuristic rules for identifying a cut and paste 
summarization operation; 
25 building a probability model based on the heuristic rules; and 

calculating the best solution of the probability model to map a 
correspondence between the summary phrases and the original phrases. 
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33 . The method of claim 32, wherein the probabiUty model is a Hidden Markov 
Model. 

34. The method of claim 33, wherein a Viterbi algorithm is employed to calculate 
the best solution. 

5 35. A corpus for a summarization system comprising: 
a plurality of documents; 

a plurality of human generated summaries associated with the plurality 

of documents; 

a sentence combination subcorpus; and 
1 0 a sentence reduction subcorpus, 

36. The corpus of claim 35, wherein the sentence combination subcorpus includes 
at least one mapping between a sximmarj^ sentence and at least two original sentences 
containing phrases in the summary sentence. 



37. The corpus of claim 35, wherein the sentence reduction subcorpus includes at 
least one sentence pair, each sentence pair having a summary sentence and a 
corresponding original sentence. 
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RULE 1: 

IF: ((a person or an organization is mentioned the first tinne) and 
(the full name or the full description of the person or the 
organization exists somewhere in the original article but is 
missing in the summary)) 

THEN: replace the phrase with the full name plus the full 
description 

RULE 2: 

IF: ((two sentences are close to each other in the original article) 

and (their subjects refer to the same entity) and (at least one 

of the sentences is the reduced form resulting from sentence 
reduction)) 

THEN: merge the two sentences by removing the subject in the 
second sentence, and then combining it with the first sentence 
using connective "and". 



FIG. 6 



SUBSnrUTE sheet (rule 28) 
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(37 CFR 1.16(e)) 
required) 



faration 
ibmitted 
with Initial 
Filing 



Attorney Docket Number 



First Nanrted Inventor 



A32312 PCT-USA 



Kathleen R. McKeown 



COMPLETE IF KNOWN 



Application Number 



Filing Date 



Group Art Unit 



Examiner Name 



09/913746 



August 16, 2001 



(Not yet assigned) 



Stephen Hong 



As a below named inventor, 1 hereby declare that: 

My residence, mailing address, and citizenship are as stated below next to my name. 

I believe i am the original, first and sole inventor (if only one name is listed below) or an original, first and joint inventor (if plural 
names are listed below) of the subject matter which is claimed and for which a patent is sought on the invention entitled: 



CUT AND PASTE DOCUMENT SUMMARIZATION SYSTEM AND METHOD 



(Title of the Invention) 



the specification of which 
□ is attached hereto 



0 



OR 



was filed on (MM/DD/YYYY) 



08/16/2001 



as United States Application Number or PCT International 



Application Number 09/913,746 



and was amended on (MM/DDATYY) 



(if applicable). 



I hereby state that I have reviewed and understand the contents of the above identified specification, including the claims, as 
amended by any amendment specifically referred to above. 

1 acknowledge the duty to disclose information which is material to patentability as defined in 37 CFR 1.56, including for continuation- 
in<)art applications, material information which became available between the filing date of the prior application and the national or 
PCT international filing date of the continuation-in-part application. 

1 hereby claim foreign priority benefits under 35 U.S.C. 119(a)-(d) or (f), or 365(b) of any foreign application(s) for patent, inventor's 
or plant breeder's rights certificate(s), or 365(a) of any PCT international application which designated at least one country other 
than the United States of America, listed below and have also identified below, by checking the box, any foreign application for 
patent, inventor's or plant breeder's rights certificate(s), or any PCT international application having a filing date before that of the 
application on which priority is claimed. 



Prior Foreign Application 
Number(s) 



Country 



Foreign Filing Date 
(MM/DD/YYYY) 



Priority 
Not Claimed 



Certified Copy Attached? 
YES NO 



□ 
□ 
□ 
□ 



□ □ 

□ □ 

□ □ 

□ □ 



□ 



Additional foreign application numbers are listed on a supplemental priority data sheet PTO/SB/02B attached hereto: 
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States provisional application(s) listed below: 



Provisional Application Number 


Filing Date 











































Claim for Benefit of Earlier U.S,/PCT Application(s) under 35 U.S.C. 120 

(complete this part only if this is a divisional continuation or C-I-P application) 
I hereby claim the benefit under Title 35, United States Code, § 120 of any United States 
application(s) or PCT international application(s) designating the United States of America that 
is/are listed below and, insofar as the subject matter of each of the claims of this application is 
not disclosed in the prior application(s) in the manner provided by the first paragraph of Title 35, 
United States Code § 1 12, 1 acknowledge the duty to disclose information as defined in Title 37, 
Code of Federal Regulations, Section 1.56 which occurred between the filing date of the prior 
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Filing Date 


Status (patented, pending, abandoned) 
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February 22, 2000 


Pending 
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DECLARATION — Utility or Design Patent Application [ 



6^ 



Direct at! coirespgndfticeto: g| uEl 




^ Correspondence address below 



Name 



Address 



City 



Country 



Stats 



Telephone 



ZIP 



I hereby dQcl^m that all statements inade herein of my own knowledge are true 9fxl that all statements made on infomralion and belief 
ar^ bdisvad to be taie; and farmer that the?e statements were made wtHi the knowledge that willful talse statements and the like so 
made are punisHabte by fine or impneonment or both, under 18 U.S^C 1001 and that such wiiHul fdlso statements may jeopardize th& 
validity of the application oc any patent issued thsreon. * 



NAME OF SOLE OR FIRST INVENTOR : 



I I A petition has been filed for this unsigned inventor 



Kathleen 

Givbh Name 

(first and middle Of any]) 



JlcKeown 

Family Name 
orSumiimo 



Invenfor^s 
Signature 



Wayne 



Residence: city 



Nj MS 



U,SA 
Country 



Date 



U.S-A. 
Citizenship 



iwailinfi Address 



20 Prospect Road 



Wayne 
Ci^ 



NAME OF SECOND INVENTOR: 



NJ 



07470 
ZIP 



U.SA 
Country 



n A petition h95 been filed for this unsigned Inventor 



(first ertd middle [if any]) 



JIng 
^SfruP^Tbrne 
or Surname 



Inventors 
Signature 



Residence: Ci^ 



Slate 



U.SA 
Country 



Dat6 



Peoples Republic of Chine 

Citizenship 



Mailing Address 



»6 21 Wogt 112th CtnGQl Apt. 73C (3>\L&S D^. /IPT^ 



City 



state 



ZIP 



U-SA 
Country 



I I Additional Inventors are being named on the supplemental Additional ltivcmor<s} shBBt(s) PTQ/SB/02A attached hereto. 
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Application Number 


09/913,746 




RpngData 


August 16, 2001 


if POWER OF ATTORNEY OR 


First NwnQd Inventor 


Kathleen R. McKeown 


AUTHORIZATION OF AGENT 


Group Aft Unit 


(Not vet assigned) 




Examiner Nsma 


Stephen Hong 




Attomoy Docket Number 


A32312 PCT-USA j 



1 hereby appoint 

Ej Practitioners at Customer Number 
OR 



21003 



Place Customer 
Number Bar Code 
L^bei here 



Name 


Realstration Number 



















as my/our attomey(s) or agent(s) to prosecute the application identifled above, and to transact ail 
business In the United States Patent and Trademark Oifioe connected therewith. 



Please change the correspondence address for the above-identified application to: 
[✓I The above-mentioned Customer Number. 

OR 



□ 



Firm or 

Indfvlduai Namf? 



Address 



Address 



City 



Country 



Telephony 



State 



Zip 



Fax 



[ am the; 

Applicant/Inventor. 

Q Assignee of record of the enfine interest. See 37 CFR 3.71 . 

Si&tQmQnt under 37 CFR 3,73(b) is endosed- (form PT0/SBi96). 



Name 



Signature 



Date 



SIGNATURE of Applicant or Assignee of Record 



Hongyan Jing 



NOTE: Signatures of aS the hventors or aesigneas of record of the entire interest or tneir repr^enteftivd(&) are required. Submit multiplB 
forms if mora than one signature is required, see below*. 



□ *Totalof_ 



_tonn9 ere ^ubmittGd. 



r 
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as my/our attorney(s) or agent(s) to prosecute the application identified above, and to transact all 
business in the United States Patent and Trademark Office connected therewith. 



Please change the correspondence address for the above-identified application to: 
The above-mentioned Customer Number. 



OR 



1 1 Firm or 
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Address 
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State 


Zip 


Country 
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Applicant/Inventor. 



I I Assignee of record of the entire interest. See 37 CFR 3.71 . 

Statemer)t under 37 CFR 3.73(b) is enclosed. (Form PTO/SB/96). 



SIGNATURE of Applicant or Assignee of Record 



Name 


Kathleen R. McKeown 


Signature 




Date 




NOTE: Signatures of all the inventors or assignees of record of the entire interest or their representative{s) are required. Submit multiple 
forms if more than one signature is required, see below*. 


□ ^otal of 


forms are submitted. 



